Greedy Coordinate Gradient

mentions 1 type Person feed RSS

// recent coverage 1 mentions

04:00

2026-05-28

arxiv.org

artificial-intelligence

Cross-Entropy Games and Frost Training

Researchers introduced Frost Training, a method that improves Monte Carlo-based policy optimization for Cross-Entropy Games by exploiting the gradient of the reward function in embedding space. The te…

// co-occurs with top 5 entities

Frost Training 1 Cross-Entropy Games 1 GRPO 1 GCG 1 LLM-as-a-judge 1